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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

( i ) APPLICANT : Hodgson , John 

Lawlor, Elizabeth 

(ii) TITLE OF THE INVENTION: Novel tRNA Synthetase 



(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP : 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 17-JAN-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 9601095.4 

(B) FILING DATE: 19-JAN-1996 

(A) APPLICATION NUMBER: 9615845.6 

(B) FILING DATE: 27-JUL-1996 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Gimmi , Edward R 

(B) REGISTRATION NUMBER: 38,891 

(C) REFERENCE /DOCKET NUMBER: P31353 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1974 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



ATGG C T AAAG 


AAACATTTTA 


m Tv m Tv 7v ^-i tv tv /""i 

TAT AAC AAC C 


CCAATATACT 


ATCCTAGTGG 


GAATTTACAT 


60 


ATAGGACATG 


CATATTCTAC 


AGTGGCTGGA 


GATGTTATTG 


CAAGATATAA 


GAGAATGCAA 


12 0 


GGA I A 1 GA I G 


TTCGTTATTT 


G AC TGG AACG 


GATGAACACG 


/""I ITI ✓"'I TV TV TV TV TV TV m 

GTCAAAAAAT 


m/*1 TV TV TV TV TV TV TV 

TCAAGAAAAA 


180 


GCTCAAAAAG 


CTGGTAAGAC 


AGAAATTGAA 


TATTTGGATG 


AGATGATTGC 


TGGAATTAAA 


240 


CAATTGTGGG 


CTAAGCTTGA 


AATTTCAAAT 


GATGATTTTA 


TCAGAACAAC 


TGAAGAACGT 


300 


CATAAACATG 


TCGTTGAGCA 


AGTGTTTGAA 


CGTTTATTAA 


AGCAAGGTGA 


TATCTATTTA 


360 


GGTGAATATG 


AAGGTTGGTA 


TTCTGTTCCG 


GATGAAACAT 


ACTATACAGA 


GTCACAATTA 


420 


GTAGACCCAC 


AATACGAAAA 


CGGTAAAATT 


ATTGGTGGCA 


AAAGTCCAGA 


TTCTGGACAC 


480 


GAAGTTGAAC 


TAGTTAAAGA 


AGAAAGTTAT 


TTCTTTAATA 


TTAGTAAATA 


TACAGACCGT 


540 


TTATTAGAGT 


TCTATGACCA 


AAATCCAGAT 


TTTATACAAC 


CACCATCAAG 


AAAAAATGAA 


600 


ATGATTAACA 


ACTTCATTAA 


ACCAGGACTT 


GCTGATTTAG 


CTGTTTCTCG 


TACATCATTT 


660 


AAC TGGGGTG 


TCCATGTTCC 


GTCTAATCCA 


AAACATGTTG 


TTTATGTTTG 


GATTGATGCG 


720 


TTAGTTAACT 


ATATTTCAGC 


ATTAGGCTAT 


TTATCAGATG 


ATGAGTCACT 


ATTTAACAAA 


780 


TACTGGCCAG 


CAGATATTCA 


TTTAATGGCT 


AAGGAAATTG 


TGCGATTCCA 


CTCAATTATT 


840 


TGGCCTATTT 


TATTGATGGC 


ATTAGACTTA 


CCGTTACCTA 


AAAAAGTCTT 


TGCACATGGT 


900 


TGGATTTTGA 


TGAAAGATGG 


AAAAATGAGT 


AAATCTAAAG 


GTAATGTTGT 


AGACCCTAAT 


960 


ATTTTAATTG 


ATCGCTATGG 


TTTAGATGCT 


ACACGTTATT 


ATCTAATGCG 


TGAATTACCA 


1020 


TTTGGTTCAG 


ATGGCGTATT 


TACACCTGAA 


GCATTTGTTG 


AGCGTACAAA 


TTTCGATCTA 


1080 


GCAAATGACT 


TAGGTAACTT 


AGTAAACCGT 


ACGATTTCTA 


TGGTTAATAA 


GTAC TTTGAT 


1140 


GGCGAATTAC 


CAGCGTATCA 


AGGTCCACTT 


CATGAATTAG 


ATGAAGAAAT 


GGAAGCTATG 


1200 


GCTTTAGAAA 


CAGTGAAAAG 


C T AC AC TG AA 


AGCATGGAAA 


GTTTGCAATT 


TTCTGTGGCA 


1260 


TTATCTACGG 


TATGGAAGTT 


TATAAGTAGA 


ACGAATAAGT 


AT ATTG AC G A 


AACAACGCCT 


1320 


TGGGTATTAG 


CTAAGGACGA 


TAGCCAAAAA 


G AT ATGT TAG 


GCAATGTAAT 


GGCTCACTTA 


1380 


GTTGAAAATA 


TTCGTTATGC 


AGC TGTATTA 


TTACGTCCAT 


TCTTAACACA 


TGCGCCGAAA 


1440 


GAGATTTTTG 


AACAATTGAA 


CATAAACAAT 


CCTCAATTTA 


TGGAATTTAG 


TAGTTTAGAG 


1500 


CAATATGGTG 


TGCTTACTGA 


GTCAATTATG 


GTTACTGGGC 


AACCTAAACC 


TATTTTCCCA 


1560 
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AGATTGGATA GCGAAGCGGA AATTGCATAT ATCAAAGAAT CAATGCAACC GCCTGCTACT 1620 

GAAGAGGAAA AAGAAGAGAT TCCTAGCAAA CCTCAAATTG ATATTAAAGA. CTTTGATAAA 1680 

GTTGAAATTA AGGCAGCAAC GATTATTGAT GCTGAACATG TTAAGAAGTC AGATAAGCTT 1740 

TTAAAAATTC AAGTAGACTT AGATTCTGAA CAAAGACAAA TTGTATCAGG AATTGCCAAA 1800 

TTCTATACAC CAGATGATAT TATTGGTAAA AAAGTAGCAG TTGTTACTAA CCTGAAACCA 1860 

GCTAAATTAA TGGGACAAAA ATCTGAAGGT ATGATATTAT CTGCTGAAAA AGATGGTGTA 1920 

TTAACCTTAG TAAGTTTACC AAGTGCAATT CCAAATGGTG CAGTGATTAA ATAA 1974 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

If! Met Ala Lys Glu 

1 

Gly Asn Leu His 
20 

lie Ala Arg Tyr 
35 

Gly Thr Asp Glu 
50 

Gly Lys Thr Glu 
65 

Gin Leu Trp^ Ala 

Thr Glu Glu Arg 
100 

Leu Lys Gin Gly 
115 

Val Pro Asp Glu 
130 

Tyr Glu Asn Gly 
145 

Glu Val Glu Leu 

Tyr Thr Asp Arg 
180 



Thr Phe Tyr lie 
5 

lie Gly His Ala 

Lys Arg Met Gin 
40 

His Gly Gin Lys 
55 

lie Glu Tyr Leu 
70 

Lys Leu Glu lie 
85 

His Lys His Val 

Asp lie Tyr Leu 
120 

Thr Tyr Tyr Thr 
135 

Lys lie lie Gly 
150 

Val Lys Glu Glu 
165 

Leu Leu Glu Phe 



Thr Thr Pro lie 
10 

Tyr Ser Thr Val 
25 

Gly Tyr Asp Val 

lie Gin Glu Lys 
60 

Asp Glu Met lie 
75 

Ser Asn Asp Asp 
90 

Val Glu Gin Val 
105 

Gly Glu Tyr Glu 

Glu Ser Gin Leu 
140 

Gly Lys Ser Pro 
155 

Ser Tyr Phe Phe 
170 

Tyr Asp Gin Asn 
185 



Tyr Tyr Pro Ser 
15 

Ala Gly Asp Val 
30 

Arg Tyr Leu Thr 
45 

Ala Gin Lys Ala 

Ala Gly lie Lys 
80 

Phe lie Arg Thr 
95 

Phe Glu Arg Leu 
110 

Gly Trp Tyr Ser 
125 

Val Asp Pro Gin 

Asp Ser Gly His 
160 

Asn lie Ser Lys 
175 

Pro Asp Phe lie 
190 
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Gin Pro Pro Ser 
195 

Gly Leu Ala Asp 
210 

His Val Pro Ser 
225 

Leu Val Asn Tyr 

Leu Phe Asn Lys 
260 

lie Val Arg Phe 
275 

Asp Leu Pro Leu 
290 

Lys Asp Gly Lys 
' 305 

lie Leu lie Asp 

Arg Glu Leu Pro 
340 

Val Glu Arg Thr 
355 

Asn Arg Thr lie 
370 

Ala Tyr Gin Gly 
385 

Ala Leu Glu Thr 

Phe Ser Val Ala 
420 

Lys Tyr lie Asp 
435 

Gin Lys Asp Met 
450 

Arg Tyr Ala Ala 
465 

Glu lie Phe Glu 

Ser Ser Leu Glu 
500 

Gly Gin Pro Lys 
515 

Ala Tyr lie Lys 
530 



Arg Lys Asn Glu 
200 

Leu Ala Val Ser 
215 

Asn Pro Lys His 
230 

lie Ser Ala Leu 
245 

Tyr Trp Pro Ala 

His Ser lie He 
280 

Pro Lys Lys Val 
295 

Met Ser Lys Ser 
310 

Arg Tyr Gly Leu 
325 

Phe Gly Ser Asp 

Asn Phe Asp Leu 
360 

Ser Met Val Asn 
375 

Pro Leu His Glu 
390 

Val Lys Ser Tyr 
405 

Leu Ser Thr Val 

Glu Thr Thr Pro 
440 

Leu Gly Asn Val 
455 

Val Leu Leu Arg 
470 

Gin Leu Asn He 
485 

Gin Tyr Gly Val 

Pro He Phe Pro 
520 

Glu Ser Met Gin 
535 



Met He Asn Asn 

Arg Thr Ser Phe 
220 

Val Val Tyr Val 
235 

Gly Tyr Leu Ser 
250 

Asp He His Leu 
265 

Trp Pro He Leu 

Phe Ala His Gly 
300 

Lys Gly Asn Val 
315 

Asp Ala Thr Arg 
330 

Gly Val Phe Thr 
345 

Ala Asn Asp Leu 

Lys Tyr Phe Asp 
380 

Leu Asp Glu Glu 
395 

Thr Glu Ser Met 
410 

Trp Lys Phe He 
425 

Trp Val Leu Ala 

Met Ala His Leu 
460 

Pro Phe Leu Thr 
475 

Asn Asn Pro Gin 
490 

Leu Thr Glu Ser 
505 

Arg Leu Asp Ser 

Pro Pro Ala Thr 
540 
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Phe He Lys Pro 
205 

Asn Trp Gly Val 

Trp He Asp Ala 
240 

Asp Asp Glu Ser 
255 

Met Ala Lys Glu 
270 

Leu Met Ala Leu 
285 

Trp He Leu Met 

Val Asp Pro Asn 
320 

Tyr Tyr Leu Met 
335 

Pro Glu Ala Phe 
350 

Gly Asn Leu Val 
365 

Gly Glu Leu Pro 

Met Glu Ala Met 
400 

Glu Ser Leu Gin 
415 

Ser Arg Thr Asn 
430 

Lys Asp Asp Ser 
445 

Val Glu Asn He 

His Ala Pro Lys 
480 

Phe Met Glu Phe 
495 

He Met Val Thr 
510 

Glu Ala Glu He 
525 

Glu Glu Glu Lys 
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Glu Glu lie Pro Ser Lys Pro Gin 
545 550 
Val Glu lie Lys Ala Ala Thr lie 
565 

Ser Asp Lys Leu Leu Lys lie Gin 
580 

Gin lie Val Ser Gly He Ala Lys 
595 600 
Gly Lys Lys Val Ala Val Val Thr 

610 615 
Gly Gin Lys Ser Glu Gly Met He 
625 630 
Leu Thr Leu Val Ser Leu Pro Ser 
645 

Lys 




He Asp He Lys Asp Phe Asp Lys 
555 560 
He Asp Ala Glu His Val Lys Lys 

570 575 
Val Asp Leu Asp Ser Glu Gin Arg 
585 590 
Phe Tyr Thr Pro Asp Asp He He 
605 

Asn Leu Lys Pro Ala Lys Leu Met 
620 

Leu Ser Ala Glu Lys Asp Gly Val 
635 640 
Ala He Pro Asn Gly Ala Val He 
650 655 
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